1/1 


AD-A157  450  A  SURVEV  OF  SOME  RECENT  RESULTS  IN  CONTINUOUS 

UNIVARIATE  DISTRIBUTIONS^ >  MARVLAND  UNIV  COLLEGE  PARK 
DEPT  OF  MANAGEMENT  SCIENCES  AND  ST.  .  S  K0T2  ET  AL. 
UNCLASSIFIED  JUL  85  UMD-DMSS-85/7  N00014-84-K-0301  F/G  12/1  NL 


AD-A157  450 


•  l..» 11  j,  m . m  im .. .  ^ U 


’wi 


-2. 


A  SURVEY  OF  SOME  RECENT  RESULTS  IN  CONTINUOUS  UNIVARIATE  DISTRIBUTIONS 
(prepared  for  presentation  at  the  Centenary  Session  of  the 
International  Statistical  Institute,  Amsterdam,  Netherlands, 

August  12-22,  1985) 


Samuel  Kotz 

University  of  Maryland 
College  Park 
Maryland,  U.S.A. 


Norman  L.  Johnson 
University  of  North  Carolina 
Chapel  Hill 


North  Carolina,  U.S. 


July  1985 


% 


ABSTRACT: 

A  description  of  a  survey  of  developments  in  continuous  univariate 
distributions  -  theory  and  applications  -  since  1970.  As  an  example,  a 
section  on  inverse  Gaussian  distribution  (and  generalized  inverse  Gaussian 
distribution)  is  appended. 

Key  Words  and  Phrases:  Continuous  univariate  distributions.  Inference, 
Generalized  inverse  Gaussian  distribution. 
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1.  INTRODUCTION 

Pursuant  to  our  work  on  advances  in  discrete  distributions  (Johnson 
and  Kotz  (1982))  we  have  commenced  work  on  advances  in  continuous 
univariate  distributions  from  1971  onwards.  In  contrast  to  the  situation 
for  discrete  distributions,  this  field  was  markedly  more  fully  developed  by 
1970,  in  regard  to  both  variety  of  available  distributions  and 
sophistication  of  methods  for  fitting.  Possibly  as  a  consequence,  there 
is  new  available  a  number  of  very  useful  survey  articles  for  particular 
families  of  distributions.  We  will  make  reference  to  these  at  appropriate 
points,  indicating  the  nature  of  their  contents.  The  interested  reader  is 
recaimended  to  study  these  sources  for  more  precise  information. 

While  taking  advantage  of  the  existence  of  these  valuable  summaries  to 
reduce  the  volume  of  our  own  work,  we  believe  we  should  include  enough 
descriptive  material  to  provide  an  adequate  picture  of  the  current 
situation.  In  our  discussion  of  distributions  already  in  use  before  1971, 
our  discussion  will  be  mostly  of  methods  of  fitting  (including  the  closely 
related  topic  of  estimation  of  parameters),  though  there  will  often  be 
examples  of  new  applications  and  occasionally  seme  new  properties  to 
report.  In  regard  to  distributions  which  have  ocme  into  use  since  1971  we 
will,  in  addition,  endeavor  to  provide  seme  indication  of  the  genesis  of 
the  distribution,  its  properties  and  relations  to  other  distributions. 


2.  Sane  General  Observations 

Two  general  methods  of  estimation  have  appeared  especially  aimed  at 
fitting  distributions  with  a  'threshold  value'  -  that  is  a  value  (unknown) 
below  which  the  density  function  is  zero,  but  above  which  this  is  not  so  - 
such  as  Wei bull,  exponential,  inverse  Gaussian. 

(a)  Cohen  &  Whitten  (1982,  1985)  have  suggested  modification  of 
itaximun  likelihood  estimation  by  replacing  the  equation  putting  the 
derivative  of  likelihood  with  respect  to  the  threshold  parameter  equal  to 
zero  by  one  equating  the  observed  and  expected  values  of  the  first  carder 
statistic  (the  least  observation).  They  also  suggest  a  similar 
modification  to  the  method  of  moments,  wherein  the  equation  relating  third 
sample  and  population  moments  is  replaced.  The  modified  method  gives 
results  comparable  with  full  maximum  likelihood  in  the  first  case,  and 
often  results  in  substantial  improvement  in  the  second. 

(b)  Cheng  and  Amin  (1983)  have  proposed  a  method  of  maximum  product 

of  spacings  (MPS),  in  which  the  product  of  the  'spacings'  Pr[X  <  |9] 

-  Pr[X  <  (i=l,2,...,n+l)  is  maximized  with  respect  to  0.  (X(-^  < 

X(2)  —  —  X(m)  are  or<^er  statistics  for  a  random  sanple  of  size  n;  X^0)  = 

-Q°,X(nfl)  “  +CD*  >  This  method  can  be  used  for  any  continuous  univariate 
distribution,  but  is  particularly  useful  or  distributions  with  threshold 
value(s).  The  authors  claim  that 

(i)  MPS  estimation  gives  consistent  estimators  under  more  general 
conditions  than  maximum  likelihood  estimation. 
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(ii)  MPS  estimators  are  asymptotically  normal  and  asympotically  as 
efficient  as  maximum  likelihood  estimators  vrtien  these  exist.  Ranneby 
(1984)  gives  a  detailed  theoretical  treatment  of  MPS  estimation. 

A  noteworthy  feature  of  recent  years  has  been  the  attention  devoted  to 
simulation  techniques,  in  particular  in  assessing  properties  of  estimators. 

One  might  think  that,  since  any  non-singular  continuous  distribution  can  be 
derived  from  any  other  by  an  appropriate  monotonic  transformation  of  the 
variable,  it  would  suffice  to  have  a  method  for  generating  values 
corresponding  to  a  single  distribution  -  for  example,  a  uniform 
distribution,  which  would  be  closely  approximated  by  values  from  standard 
tables  of  randan  numbers.  It  is  true  that  same  programs  do,  in  effect,  use 
such  an  approach,  but  others  are  more  sophisticated  and  exploit  specific 
features  of  particular  distributions. 

There  have  been  sane  studies  catparing  different  distributions.  These 
are  necessarily  of  a  somewhat  empirical  nature.  This  is  by  no  means  a 
empirical  term  -  the  use  of  criteria  such  as  average  absolute  density 
difference  is  not  often  informative  for  practical  purposes.  When  there  is 
really  close  mimicry  among  different  distributions,  there  is  very  strong 
inducement  to  use  the  one  leading  to  simpler  calculations.  If,  in  fact, 
vise  of  one,  rather  than  another,  of  two  closely-agreeing  distributions, 
makes  any  substantial  difference  in  an  inference  procedure,  it  is  the 
procedure  itself  (cr  associated  theory)  which  should  be  regarded  as  of 
doubtful  utility.  On  the  more  theoretical  side  there  has  been  work  on 
conditions  under  which  distributions  are  ' comparable '  (e.g.  Lisek  (1978)) 


but  this  seems  to  hinge  on  assumptions  that  one  ’ought’  to  be  able  to 
summarize  comparisons  in  terms  of  a  few  indices  (usually  one). 

The  above  remarks  should  not  be  taken  as  disparagement  of 
construction  and  investigation  of  new  distributions.  Indeed,  the  fact  that  it 
is  often  the  case  that  it  is  found  that  a  rather  complicated  distribution, 
derived  from  a  carefully  constructed  model,  can  be  adequately  represented 
by  a  much  simpler  distribution  is  the  basis  for  substantial  advances  in 
applicability.  But  the  more  complicated  distribution  has  to  be  studied 
first  in  order  to  establish  vh ether  there  is  a  dose  relationship.  Also, 
from  a  general  theoretical  point  of  view,  studies  of  systems  of 
distributions  can  assist  greatly  in  attaining  a  broad  grasp  of  the  subject. 

Space  does  not  permit  presentation  of  even  one  example  of  the  detailed 
treatment  we  wish  to  use.  Section  3,  appended  here,  is  such  an  example, 
shewing  a  discussion  of  inverse  Gaussian  distribution  and  including 
discussion  of  generalized  inverse  Gaussian  distributions. 

APPENDIX 

3.  INVERSE  GAUSSIAN  (IG)  DISTRIBUTION 


for  the  PDF.  The  corresponding  CDF  is 


Fx(x|m,9) 


$ 


(x-i4>ytr 
.  0-/5T"  , 


+  e20/lJ  $ 


(x+u)ye“ 

0Vx“" 


(2) 


where  i$(.)  is  the  standard  normal  cumulative  distribution  function.  We 
will  use  the  notation  IG(u,0)  or  just  IG,  for  this  distribution. 

The  mean  and  variance  are 

E[X]  -  p;  var(X)  -  y3/0 
and  the  skewness  is  «3  *  2-/ (0/V). 

Since  taking  \x=0  (to  make  E[X]  *  0)  also  makes  var(X)  =  0,  a 
standardized  form  of  the  distribution  has  to  be  obtained  circuitously.  By 
taking  the  variable 

Y  =  X  +  £  (£  <  0) 

so  that  E[Y]  =  p  +  5;  var(Y)  »  p3/0;  <x.j(Y)  *  3/(0/p)  and  then  taking 
\i  «  0  =  -£3  we  have 

E[Y]  *  0;  var(Y)  «  1;  «3(Y)  =  3|£|  . 

The  CDF  of  Y  is 


Fy(y|<x3)  »  $(y(l  +  ^T*)  +  exp(18*3"2)  $|-(y  +  ea^Hl  +  |  «3>  1J  (3] 

Chan  et  al.  (1983)  give  tables  of  F(yj«3>  to  6  decimal  places  for 
y  *  -3. 0(0. 1)5. 9  with  «3  «  0.0(0.1)1.2  and  y  *  -1.5(0. 1)7. 4  with 
a,  -  1.3(0. 1)2.5. 
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The  distribution  of  Y  is  a  3-parameter  inverse  Gaussian  (parameters  p, 

2 

0,  £) ;  a  1-parameter  inverse  Gaussian  obtained  by  taking  0  =  p  and  £  ■  0 
has  been  discussed  by  Vod&  (1973)  (See  also  Iliescu  and  Voda  (1977)).  It 
has  PDF 

fx(x|u)  »  y(2ifx3)”^  exp{-  i  x_1(x-vi)2}  (4) 

An  excellent  surmary  of  properties  of  the  inverse  Gaussian 
distribution,  established  prior  to  1978,  is  provided  in  Folks  and  Chhikara 
(1978). 

An  important  generalization  of  the  inverse  Gaussian  distribution, 
introduced  by  Good  (1953)  and  studied  by  Wise  (1971,  1975)  has  attracted 
considerable  attention  recently,  exemplified  especially  by  an  exhaustive 
and  interesting  monograph  by  Jorgensen  (1982).  He  uses  an  additional 
parameter,  \,  and  defines  the  H3F  as 


<Vx>*X 
2  Kx(/%)T 


xX_1 


exp(-  i  (xx-1  +  t^x)  } 


(x  >  0) 


(5) 


where  K^t.)  a  Bessel  function  of  the  third  kind.  If  X  *  -i  we 

have  an  inverse  Gaussian  distribution  with  parameters  0  *x»  M  -  -/(4/y)  • 
Other  special  cases  are 

X  *  |  -  distribution  of  the  reciprocal  of  an  inverse  Gaussian 
variate. 

X  *  0  ,  X  >  0  -  garrma  distribution, 
and  ^  «  0  ,  X  <  0  -  reciprocal  gamma  distribution. 


For  \  *  0  the  'hyperbolic  distribution'  ( Bar ndorff -Nielsen  (1978), 


Rukhin  (1974))  is  obtained. 

An  alternative  form  of  the  PDF,  more  symmetrical  in  appearance,  is 
used  by  Jorgensen  (1982).  This  is  obtained  by  putting  m  =  /^\p ;  n  = 

/<  %/ip)  leading  to 

fx(x)  -  {?n  (w)}"1  xX_1  exp{-  i  u(t)  -1x  +  n  x"1)}  (6) 

(x  >  0;  at,  n  >  0) 

If  X  =  0  or  4*  *  0,  so  that  a)  *  0  the  IG  distribution  (1)  corresponds  to 
(6)  with  X  *  a)  *  0/p,  n  =  M  (and  K_^  (id)  ■  y/(n  /ZiuT^e-^). 

Hazard  Function.  For  X  <  1  the  hazard  function  is  unimodal  with  zero 
initial  value  and  asymptotic  value  ip/2.  The  mode  m^  of  the  hazard  function 
satisfies 

X(2(i-x))“1  <%<  xd-xr1 

For  x*  0  we  have  a  Gamma  distribution  with  0  <  X  <  1  which  is  a 
degenerate  case  with  the  hazard  function  infinite  at  the  origin  and 
decreases  towards  the  asymptotic  value  ip/2.  Thus  for  the  reciprocal  gamna 
distribution  (i|>  ■  0,  X  <  0)  the  hazard  function  has  zero  asymptotic  value. 
For  X  >  1  the  hazard  function  starts  with  zero  and  increases  towards  the 
asymptotic  value  ip/2.  The  exception  is  the  exponential  case  (X  *  0,  X  * 

1)  when  the  hazard  rate  is  constant  and  equals  i>/2.  (of.  Jorgensen  (1982)) 


to  have  a  standard  normal  distribution. 

J/Srgensen  (1982)  reports  an  approximation  based  on  supposing 

(2X  +  X  to  have  a  standard  normal  distribution,  and  suggests  that 

even  better  approximation  might  be  obtained  by  using  log  X  as  approximately 
normal. 


Estimation  Estimation  of  the  3-parameter  inverse  Gaussian  distribution 
with  PDF 


fY(y|p,0,  )  = 


\  r 

0  i 

. - ?  ■  exp. 

2tt  (y-c>  J 


0(y-C-p)2 

2(y-C)M2 


(9) 


(y  >  C;  0rM  >  0) 


has  been  discussed  by  Padgett  and  Wei  (1979 ),  Cheng  and  Amin  (1981),  Jones 
and  Cheng  (1984),  Chan  et  al.  (1984),  and  Cohen  and  Whitten  (1985). 

Moment  and  maximum  likelihood  estimation  is  discussed  by  Padgett  and 
Wei;  and  maximum  likelihood  estimation  by  Cheng  and  Amin,  with  special 


attention  to  consistency  and  asymptotic  efficiency.  The  two  methods  of 
estimation  are  compared  by  Jones  and  Cheng/  who  find  maximum  likelihood  to 
be  clearly  superior.  Modified  maximum  likelihood  and  moment  estimation,  in 
which  the  equation  obtained  by  equating  expected  and  observed  values  of  the 
first  (least)  order  statistic  replaces  the  maximum  likelihood  equation 
equating  3  log  (likelihood)  /  3£  to  zero,  or  that  equating  sample  and 
population  values  of  the  third  moment  to  zero,  are  discussed  by  Chan  et  al. 
Cohen  and  Whitten  provide  tables  to  assist  in  ocmputation  of  the  solution 
of  the  newly  introduced  equation  for  £  in  terms  of  pi  and  0. 

Maximum  likelihood  estimation  of  the  2-parameter  1G  distribution  is 
discussed  by  Gupta  (1973).  Bayesian  estimation  of  reliability  was 
considered  by  Padgett  (1981),  which  was  recently  generalized  by  Howlader 
(1985). 

The  information  matrix  for  a  mixture  of  two  IG  distributions  has  been 
evaluated  by  Al-Hussaini  and  Ahmad  (1984). 

Genesis  Huff  (1975)  has  given  a  heuristic  derivation  of  the  well-known 
result  that  the  inverse  Gaussian  can  arise  as  the  distribution  of  first 
passage  time  in  Brownian  motion.  Barndorff -Nielsen  et  al.  (1978)  have 
shown  that  generalized  inverse  Gaussian  distribution  with  \  <  0  can  arise 
as  the  distribution  of  first  passage  time  of  a  time  homogeneous  diffusion 
process  x(t)  over  [0,oo)  with  infinitesimal  mean  £(x)  and  variance  ac(x), 


r-  U-x  s*  kx-1  i8(x)/7|  ,  dg(X| 

I28'*1  Ve<x,/p  J  4  dx 


(if  ip  >  0) 


*  S<x(x) 


2\  +  1 
2  e(x) 


,  1  d  <x(x) 
4  d  x 


(if  ip  -  0) 
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The  parameters  of  the  GIG  distribution  are  0  (x(0))  and  ^  •  where 
Jorgensen  (1982)  shews  that  the  GIG  can  be  regarded  as  a  limiting  case  of 
generalized  hyperbolic  distribution. 


Applications  Marcus  (MS  1974-5)  has  suggested  that  the  IG  distribution  be 

used  in  place  of  the  lognormal  distribution  when  longtailed  distributions 

are  expected  to  be  appropriate  -  for  example  in  distribution  of  sizes  of 

particles  in  aggregrates  (Bardsley  (1980)).  If  distributions  of  sums  of 

individual  random  variables  (convolutions)  are  to  be  studied,  it  may  be 

possible  to  use  the  fact  that  if  IG  variables  n  IG(p^,0^)  have  the  same 

2 

value  <j>,  say,  for  the  ratio  of  variance  to  expected  value  (p^/0^)  then 

n  n 

(X^  +  X2  +  ...  +  Xfi)  has  an  IG  ^  p^,  4*  *  (  ^  Pj)  2  distribution. 

i=l  i=l 

IG  distributions  have  been  used  to  represent  duration  of  strikes 

(Lancaster  (1972));  hospital  stays  (Whitmore  (1975,  1976));  labor  turnover 

(Whitmore  (1979));  aggregate  insurance  claims  (Seal  (1978));  lifetime 

(Chhikara  and  Folks  (1977);  and  sentence  length  (Sichel  (1974,  1975)).  A 

critical  conparison  of  the  use  of  lognormal  and  IG  distributions  for 

duration  of  strikes  has  been  given  by  Lawrence  (1984). 

An  application  in  accelerated  life  testing  has  been  described  by 
Bhattacharyya  and  Fries  (1982).  It  is  supposed  that  for  a  given  stress, 
X,  the  conditional  distribution  of  lifetime  is  IG((a +  8H)-1,  *)  • 


Inference  The  statistic  /  *ir  /  *i  1S  rtaniinal  sufficient  for  the 

•  1=1  i=l 

n  n  n 

IG  distribution;  ^  X^,  ^  /  1°9  Xi  is  minimal  sufficient  for 

i=l  £=i  £^i  j 

the  GIG  distribution.  Jorgensen  (1982)  uses  the  notation  X  ,  X_^,  X  for 
the  three  last  quantities,  and  we  will  follow  this  example,  also 
introducing  X  =  n^X,X,=n^X,;X  =  n  ^  X  . 

The  well-known  analogy  between  analysis  of  data  based  on  normal  and  on 
IG  parent  distributions,  (clearly  presented  by  Folks  and  Chhikara  (1978)), 
whereby  the  ML  estimators  of  p  and  0  for  IG  data  are 
p  =  X  ;  0  =  (X  -  X-1)-1  , 

AS 

At.  A 

p  and  0  are  mutually  independent,  p  has  a  IG(p,n0)  distribution  and  n  0/0 
has  a  xn-1  distribution,  does  not  extend  to  generalized  IG  distributions. 
Folks  and  Chhikara  (1978)  draw  attention  to  the  algebraic  formulae 

n  n 

1  0  [i~2  ^  xT1(Xi-p)2  =  i  0  ^  (XT1  -  X_1)  +  i  n0  M~2  )T1(X-p)2 
i=l  i=l 


and 


k  ni 


)  )  f  x7l  -  X_1]=  \  n.  (XT1  -  X"1) 


L .  L.  i -i3 

i=1j=i 


/_  "iv  i. 

i=l 


k  ni 


+  )  )  (xTj  -  x?1) 

1—  i—  1  • 
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where  X.  *  n.  -  )  X..  and  X  =  (\  n.  )“^  ^  n.X. .  These  are 
1.  1  1  j  •*  L-  1  L—  1  x 

j=l  i=1  i=1 

analogs,  for  IG  parent  populations,  to  the  standard  decompositions  of  sums 
of  squares,  used  in  analysis  of  variance  for  data  in  one-way  classification 
by  groups,  for  normal  parent  populations. 

They  also  show  that  for  the  analog  of  a  standardized  normal  variable 
U  =  (ne^CX-pMuX*)"1 

U2  is  distributed  as  \  with  1  degree  of  freedom  but  U  is  not  normally 
distributed.  Similarly,  defining 

w  =  U/{^j  *  6/0}* 

2 

W  is  distributed  as  the  square  of  Student's  t  with  (n-1)  degrees  of 
freedom,  but  W  is  not  distributed  as  Student's  t. 

Folks  and  Chhikara  (1978)  also  provide  results  relevant  to  comparison 
of  populations  with  IG  distributions  and  to  regression  analysis. 

Uniform  minimum  variance  estimation  of  parameters  of  IG  distributions 
is  discussed  by  Iwase  and  Seto  (1983). 


Recent 


on  GIG  Distributions 


Letec  and  Seshadri  (1983)  have  provided  a  characterization  of  GIG 
distributions. 

Embrechts  (1983)  has  shown  that  the  GIG  distribution  (5)  belongs  to  a 
class  of  subexponential  distributions. 
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